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Abstract. Many a time an objective of a statistical analysis is to estimate a limit value like 3-sigma 95% 
confidence upper limit from a data sample. The generalized Extreme Value Distribution method can be 
profitably employed in many situations for such an estimate. ' 

It is well known that according tojthe Central Limit theorem the mean value of a large data set is normally 
distributed irrespective of the distribution of the data from which the mean value is derived. In a somewhat 
similar fashion it is observed that many times the extreme value of a data set has a distribution that can be 
formulated with a Generalized Distribution. 

For a set of data Xi,x 2> ... J.,x n , let the mean value be denoted by X m where 

Xm = (X\+x 2 +..+...+..+xJ/n 

Let 2 = (X^,- ji) / cWn) where jj., cr 2 are population mean and variance. 

Central limit theorem states that for large n, Z is normally distributed with zero mean and unit variance. 
Similarly in many situations the extreme value of a large data set has a Generalized Extreme value 
distribution given by 

G(z) = exp Hl+5((z-n)/a)]- ,/4 }, 

where £ is a parameter that together with p and a specifies distribution characteristics. 

In space shuttle entry with 3-string GPS navigation the Figure Of Merit (FOM) value gives a measure of 
GPS navigated state accuracy. A GPS navigated state with FOM of 6 or higher is deemed unacceptable and 
is said to form a FOM 6 or higher| chimney. A FOM chimney is a period of tune during which the FOM 
value stays higher than 5. A longer period of FOM of value 6 or higher causes navigated state to 
accumulate more error for a lack of state update. For an acceptable landing it is imperative that the state 
error remains low and hence at low altitude during entry GPS data of FOM greater than 5 must not last 
more than 138 seconds. 

To test the GPS performance many entry test cases were simulated at the Avionics Development 
Laboratory. Only high value FOM chimneys are consequential. The extreme value statistical technique is 
applied to analyze high value FOM chimneys. The Maximum likelihood method is used to determine 
parameters that characterize the GEV distribution. And then the limit value statistics are estimated. 
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GPS FOM Chimney Analysis using Generalized 
Extreme Value Distribution 


■ Rick Ott, Joe Frisbee and Kanan Saha 
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STPOR Recovery Data 

• Data: One entire small set, n = 60 (59) 

• Data does NOT pass Normal QQ Regression Test 
r = .949377, p-value < 1 % (See Advanced Statistics: Monte 

Carlo, Bootstrapping, Multivariate Methods, 1991 C. Hallum & R. Chhikara) 

• Since complete set is given, the first method 
chosen is a Generalized Pareto Threshold Model 

• Threshold Chosen from Mean Residual Life Plot 
and Parameter Threshold Plots (if possible) 

• Generalized Pareto Threshold Model is 
Nonparametric, Very Conservative, No 
Assumptions besides Max of F(x) follows a 
Generalized Extreme Value Distribution! 
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STPOR Recovery Mean Residual Life Plot 

(w/ 289) 
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STPOR Recovery Mean Residual Life Plot 

(wo/ 289) 
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Mean Residual Life Plots 

• Without 289 MRLP suggest threshold u « 70 

• With 289 MRLP plot severely influenced by 
suspected 289 outlier - No threshold suggested 

• PROBLEM: Even if 70 is used as threshold 
only 12(11) points used for parameter 
estimation. Without 289 Data Set, software 
does not converge to give estimates if 
threshold above 63 . 
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Parameter Threshold Plot wo 289 




Threshold 


FROM : UNITED SPPCE PLL I PNCE 281 212 6326 2004,01-12 16:13 #273 P. 11/21 


JAN 12 ’04 18=12 281 212 6326 PAGE. 12 


STPOR Recovery GPD 
(Generalized Pareto Distribution) 
Estimates and Standard Errors (w/ 289) 

Threshold 63 


Number of points used in estimation: 28 out of 60 
Max Negative log-likelihood: 107. 126 
mles: a* = 9.7694690 £=0.5466667 
se_* = 3.2344973 se, = 0.2908163 
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Threshold 63 w 289 Diagnostic Plots 
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STPOR Recovery GPD 
£, Profile Log-Likelihood (w/ 289, u = 63) 
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STPOR Recovery GPD 99.865%tile 
Profile Log-Likelihood (w/ 289, u = 63) 
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Profile Log-likelihood 
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STPOR Recovery GPD 99.865%tile 
Profile Log-Likelihood (w/ 289, u = 70) 
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Better Cl for 99.865%tile 
(100,10000) with higher threshold 
as MRLP suggested - Still 
expected with one suspect outlier, 
bad GPD fit. and so few points 
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i 

STPOR Recovery GEV 
Estimates and Standard Errors (wo/ 289) 

Threshold 63 

Number of points used in estimation: 27 out of 60 
Max Negative log-likelihood:96. 18224 
mles: a* - 15.9403808 ^ = -0.2065505 
se a » = 10.0720236 se^ = 0.6031458 
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Threshold 63 wo 289 Diagnostic Plots 


Probability Plot 



Return Level Plot 
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Profile Log-likelihood 


STPOR Recovery GPD 
^ Profile Log-Likelihood (wo/ 289, u = 63) 
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MLE Shows Short Tail but Cl 
includes 0, thus maybe long, 
medium, or short tailed - 


Expected with such few points 
in total data set 
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STPOR Recovery GPD 99.865%tile 
Profile Log-Likelihood (wo/ 289, u = 63) 



\ Better Cl for 99.865%tile 

(100,950) - Expected with 
\ no suspect outliers, better 

■ x GPD fit, but so few points 
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Conclusions 

• Data does NOT pass Normality Test 

• Large Cl for 99.865%tile due to lack of data and 
nonparametric GPD method 

• Next to Come: 

- Weissman Estimates based on assuming a specific GEV 
family for k largest statistics 

- Extreme Spacing Test to Reject Medium Tail 
Distribution in Favor of Large or Short Tail 
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